Towards Finding Valuable Topics
نویسندگان
چکیده
Enterprises depend on their information workers finding valuable information to be productive. However, existing enterprise search and recommendation systems can exploit few studies on the correlation between information content and information workers’ productivity. In this paper, we combine content, social network and revenue analysis to identify computational metrics for finding valuable information content in people’s electronic communications within a large-scale enterprise. Specifically, we focus on two questions: (1) how are the topics extracted from such content correlate with information workers’ performance? and (2) how to find valuable topics with potentially high impact on employee performance? For the first question, we associate the topics with the corresponding workers’ productivity measured by the revenue they generate. This allows us to evaluate the topics’ influence on productivity. We further verify that the derived topic values are consistent with human assessor subjective evaluation. For the second question, we identify and evaluate a set of significant factors including both content and social network factors. In particular, the social network factors are better in filtering out low-value topics, while content factors are more effective in selecting a few top high-value topics. In addition, we demonstrate that a Support Vector regression model that combines the factors can already effectively find valuable topics. We believe that our results provide significant insights towards scientific advances to find valuable information.
منابع مشابه
Expert Stance Graphs for Computational Argumentation
We describe the construction of an Expert Stance Graph, a novel, large-scale knowledge resource that encodes the stance of more than 100,000 experts towards a variety of controversial topics. We suggest that this graph may be valuable for various fundamental tasks in computational argumentation. Experts and topics in our graph are Wikipedia entries. Both automatic and semi-automatic methods for...
متن کاملWeka machine learning for predicting the phospholipidosis inducing potential.
The drug discovery and development process is lengthy and expensive, and bringing a drug to market may take up to 18 years and may cost up to 2 billion $US. The extensive use of computer-assisted drug design techniques may considerably increase the chances of finding valuable drug candidates, thus decreasing the drug discovery time and costs. The most important computational approach is represe...
متن کاملEarly Identification of Personalized Trending Topics in Microblogging
Social media has become a primary platform for the spread of information. Trending topics, which are breaking news and immediately popular stories, have become an attractive data source facilitating the spread of emerging issues. Motivated by the diverse trending topics covering from sports to politics, it is essential to help users find personalized trending topics. Since a topic in social med...
متن کاملAttitude of nurses, instructors and nursing students towards the care of elderly patients(A systematic review)
Introduction: Nurses, as nursing care providers, have a pivotal role and a unique role in effecting the quality of care. The attitude of nurses on their preference for working with elderly people and the quality of care provided to them is effective. This study aimed to investigate the attitude of nursing students and nurses towards the care of elderly people. Method: To obtain articles on th...
متن کاملBovine Tuberculosis - Towards a Science Based Control Strategy
In order that future TB control policies can be science based, Defra on ISG advice has put in place a comprehensive programme of research to better understand the epidemiology of TB in both cattle and badgers.1,2,3 This research is now providing a flow of valuable data on a range of topics, extending beyond the trial on culling badgers that was originally proposed in the Krebs report.4 Emerging...
متن کامل